Bulk Insertion for R-Tree by Seeded Clustering

نویسندگان

  • Taewon Lee
  • Bongki Moon
  • Sukho Lee
چکیده

In many scienti c and commercial applications such as Earth Observation System (EOSDIS) and mobile phone services tracking a large number of clients, it is a daunting task to archive and index ever increasing volume of complex data that are continuously added to databases. To eÆciently manage multidimensional data in scienti c and data warehousing environments, R-tree based index structures have been widely used. In this paper, we propose a scalable technique called Seeded Clustering that allows us to maintain R-tree indexes by bulk insertion while keeping pace with high data arrival rates. Our approach uses a seed tree, which is copied from the top k levels of a target R-tree, to classify input data objects into clusters. We then build an R-tree for each of the clusters and insert the input R-trees into the target R-tree in bulk one at a time. We present detailed algorithms for the seeded clustering and bulk insertion as well as the results from our extensive experimental study. The experimental results show that the bulk insertion by seeded clustering outperforms the previously known methods in terms of insertion cost and the quality of target R-trees measured by their query performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bulk insertion for R-trees by seeded clustering

We propose a scalable technique called Seeded Clustering that allows us to maintain R-tree indices by bulk insertion while keeping pace with high data arrival rates. Our approach uses a seed tree, which is copied from the top k levels of a target R-tree, to classify input data objects into clusters. We then build an Rtree for each of the clusters and insert the input R-trees into the target R-t...

متن کامل

Two-Phased Bulk Insertion by Seeded Clustering for R-Trees

With great advances in the mobile technology and wireless communications, users expect to be online anytime anywhere. However, due to the high cost of being online, applications are still implemented as partially connected to the server. In many data-intensive mobile client/server frameworks, it is a daunting task to archive and index such a mass volume of complex data that are continuously add...

متن کامل

GBI: A Generalized R-Tree Bulk-Insertion Strategy

A lot of recent work has studied strategies related to bulk loading of large data sets into multidimensional index structures In this paper we address the problem of bulk insertions into existing index struc tures with particular focus on R trees which are an important class of index structures used widely in commercial database systems We pro pose a new technique which as opposed to the curren...

متن کامل

Bulk Insertions into R-Trees

A lot of recent work has focussed on bulk loading of data into multidimensional index structures in order to eeciently construct such structures for large datasets. Previous work on bulk loading data focussed at building index structures from scratch, while the problem of bulk insertions into existing index structures has been largely overlooked. In this paper, we address this new problem with ...

متن کامل

Advanced concepts and applications of the UB-tree

The UB-Tree is an index structure for multidimensional point data. By name, it claims to be universal, but this imposes a huge burden, as there are few things which really prove to be universal. This thesis takes a closer look at aspects where the UB-Tree is not universal at a first glance. The first aspect is the discussion of space filling curves (SFC), in particular comparing the Z-curve and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003